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Abstract 

We investigate the nature of genetic drift acting at the leading edge of range expansions, 
building on recent results in [Hallatschek et al., Proc. Natl. Acad. Sci., 104(50): 19926 - 19930 
(2007)]. A well mixed population of two fluorescently labeled microbial species is grown in 
a circular geometry. As the population expands, a coarsening process driven by genetic drift 
gives rise to sectoring patterns with fractal boundaries, which show a non-trivial asymptotic 
distribution. Using simplified lattice based Monte Carlo simulations as a generic caricature 
of the above experiment, we present detailed numerical results to establish a model for sector 
boundaries as time changed Brownian motions. This is used to derive a general one-to-one 
mapping of sector statistics between circular and linear geometries, which leads to a full un- 
derstanding of the sectoring patterns in terms of annihilating diffusions. 

keywords: genetic drift; range expansion; domain coarsening; time-changed Brownian mo- 
tion; annihilating random walks 

1 Introduction 

Genetic drift is one of several evolutionary processes which lead to changes in allele frequencies 
in populations over time. As opposed to natural selection, it is driven purely by unspecific random 
sampling and chance events, and is now widely accepted to be a major evolutionary force (see HI 

2 J and references therein). In addition to selective pressure, genetic drift leads to variations in allele 
frequencies. In large populations the variations are typically small compared to the population size, 
and these systems are dominated by the law of large numbers. But in small populations genetic 
variation and fluctuations in allele frequencies can have a significant impact [ 3 ] and may even lead 
to speciation [1]. For some species such as humans it has even been shown to be the primary force 
responsible for a particular feature EHH. Range expansions are frequently seen in the history of 
many species [6] including humans [7], when they encompass new areas to draw upon greater 
resources in the newly found habitats or driven by changing environmental conditions. Climatic 
changes trigger episodes of range contractions and expansions in many plant and animal species. 
For example, in recent decades, the increase in global temperatures has lead to pole-ward range 
extensions for a variety of species [8]. Range expansions leave a distinct pattern in the genetic 
structure of a population, which is found to be a reduction in diversity and an increased linkage 
between genetic markers 0. This enables historical patterns of range expansions to be traced Q. 
A special case of genetic drift and range expansion is known as the founder effect. When a small 
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Figure 1: Growth patterns of green and red cells grown from a well mixed circular initial population of 
about 10 6 cells. Left: Fluorescent image of real experimental data [2] for flourescently labeled E. coli 
bacteria, taken after 84 hours of growth. Right: Data of a single Monte Carlo sample of the lattice based 
growth model (see Section 2) of comparable inital population size. Both share the same characteristic 
coarsening patterns. 

subset of the population breaks away and forms a new colony, the founders have strong effects 
on the newly created population's genetic make-up. Through random sampling it is expected that 
there will be differences in allele frequencies between the original population and the colony, and 
over the course of many generations this genetic difference will amplify [9]. For example the 
expansion of humans from a small population in Africa to the whole world has lead to a decrease 
in heterozygosity in the population fl4ll6ll7l [T0ll . 

The motivation for this paper are results in 0, where the connection between range expan- 
sions and random genetic drift is analysed in a simplified experimental setting for populations of 
microbial systems. A well mixed population of two fluorescently labeled E. coli strains is placed 
at the center of an agar plate containing rich growth medium. Apart from the flourescent label the 
bacteria are genetically identical. Fig.[T](left) shows a fluorescent image of the bacterial population 
at the end of a growth period of four days. Compactness within the initial habitat and the use of im- 
motile E. coli leads to the population evolving only at the leading edge. As the colony grows, cells 
that loose contact to the colonization front can no longer participate in the colonization process. 
So the reproducing subset is a small fraction of the total population, chance effects are enhanced 
and provide the major force for gene segregation via genetic drift. From the initially well mixed 
population, sectors of single gene alleles emerge, which can either expand or loose contact to the 
population front. This drives a coarsening process, which leads to a gradual decrease in the number 
of sectors as a consequence of genetic drift acting at the leading edge of the range expansion. The 
sectors surviving this annihilation process grow in size at the cost of those left behind, and appear 
to surf on the wave front JUQj]]. Since the experiment uses immotile E. coli on hard agar, there 
is no noticable change in the population behind the expanding front. Therefore Fig.[T]provides a 
frozen record of the coarsening process during colonization. The domain coarsening process can 
be understood through the fluctuating path of the boundaries separating neighbouring sectors. A 
domain is frozen out when the leading ends of its boundaries meet and annihilate. A quantitative 
model and analysis of sector boundaries is presented in |[T2l[T3l . where the assumption was made 
that the fluctuations of the motion taken by the leading tip of the domain boundary are diffusive. 
However, the authors also discuss that this idealization is not quite correct, since already in Q 
the mean squared displacement of the sector boundaries was estimated from experimental data 
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to grow superdiffusively. This superdiffusive behaviour can be attributed to the roughness of the 
population front, following works on competition interfaces [14] for rough surfaces in the KPZ 
universality class 1031, 

In this paper, we incorporate the superdiffusivity of the sector boundaries and adapt the model 
introduced in |[T2l . Based on ideas in that paper, we present a new approach to separate the 
domain coarsening due to genetic drift and the domain growth due to range expansion. Under 
an appropriate mapping the circular growth experiment can be understood in terms of annihilat- 
ing one-dimensional diffusions in a generic linear geometry. This process has been well studied 
mathematically (see e.g. |[T6l[T7l and references therein) and serves as a prototype for the coars- 
ening dynamics resulting from genetic drift. The geometric influence due to range expansions is 
completely summarized by the mapping. We argue that this approach is applicable in various sit- 
uations, and provides a general understanding of the competing effects of genetic drift and range 
expansions on population fronts. To corroborate our results with good numerical precision, we 
divise a lattice based spatial stochastic process similar to |[T3l reproducing the basic features of 
the experiments in 0, and use Monte Carlo simulations for sampling. We would like to stress 
that the purpose of this model and the whole paper is not an accurate quantitative reproduction of 
experimental data from [2], but a contribution to the fundamental understanding of the effects of 
genetic drift on expanding population fronts as a generic emergent phenomenon for a large class 
of models. 

In the next section we provide a description of the model and simulation methods, followed 
by a detailed numerical analysis of the sector boundaries in Section 3, which leads to an effective 
mathematical description of their fluctuations. This is used in Section 4 to understand the domain 
coarsening under range expansions via a mapping to a generic linear geometry, which is our main 
result. In Section 5 we summarize and discuss the applicability of our approach in more general 
situations. 

2 Model and simulation method 

A continuous time Markov chain model on a two-dimensional square lattice is used to replicate the 
basic features of the experiments performed in Species labeled A and B are used to represent 
the two distinctly coloured E. coli strains. As the strains are otherwise genetically identical, the 
species A and B reproduce with the same rates ta = re> which we take equal to one. As seen in the 
experiment [2 J, the growth region has a finite width and we approximate this simply by allowing 
only those cells to reproduce which occupy a position at the colony front. Any descendant of a cell 
inherits the type (A or B) from its parent and is placed with equal probability on one of the free 
neighbouring sites. When a cell loses contact to the boundary, it is dropped out of the reproducing 
subset and becomes part of the frozen interior of the colony. As the expansion increases these cells 
provide a traceable record of the random chance events that have taken place, which enables us to 
classify the ancestors of the current reproducing subpopulation. 

This process is simulated using a Monte Carlo algorithm similar to a model in [13] with random 
sequential update, which corresponds to the simulation of the jump chain of the Markov model 
(see e.g. |[T8ll . Section 2.2). In each time step, one cell is chosen uniformly at random from the 
reproducing set E(t) with N(t) = \E(t)\ members. Then a cell of the same type is put on one 
of the neighbouring empty sites which is also chosen uniformly at random. The time counter 
t i — y t ~t~ Exp(N(t)) is updated by adding an exponential random variable with mean l/N(t) 
which is the waiting time of the Markov chain. As a last step the reproducing set E(t) is updated 
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according to the growth event that just happened. Depending on the geometry the new cell can 
become a member of E(t), and others may have to be removed if they lost their only empty 
neighbouring site. For large sizes of E(t) the waiting time is typically approximated by its mean 
value 1/N(t), which gives a very good approximation of the continuous time Markov chain model. 
We model the growth experiment [2] on the agar plate by initializing an approximately circular 
subset of the square lattice of radius ro with A and B cells independently with probabilities 1/2, 
leading to a uniform mixture. The initial reproducing set E(0) is then given by the boundary of that 
circular set, containing of the order of 2-7rro cells. In Figure [T] we compare a single sample of the 
Monte Carlo simulation with the experimental picture from |2] where irr% » 10 6 (i.e. r 564), 
and both show very good qualitative agreement. This suggests that our Markovian model provides 
a good reproduction of the basic features of the experiment on a large scale, more quantitative 
evidence will be given later. This is quite remarkable, since we have ignored all microscopic details 
of the reproduction mode of E. coli which is clearly not spatially homogeneous or Markovian (see 
and references therein). This is a strong indication that the observed segregation is an emergent 
phenomenon which is to a large extent independent of microscopic details of reproduction, as is 
discussed in more detail in Section 5. 

In our model the size of the population front is basically given by the circumference and 
grows linearly with the radius. In Section 4 we will show that all statistical properties of this 
circular expanding process can be fully understood by mapping it onto a linear geometry where the 
population front has a fixed width. To simulate this we take a strip {1, . . . , w} x Z + of finite width 
w G Z + with periodic boundary conditions in x-direction. At height h = we initialize the lattice 
with A and B cells independently with equal probabilities and then apply our simulation algorithm 
with the restriction of growing only on sites with positive height h > 0. The initially flat surface 
will grow in height with local fluctuations building up (see Figure 2), and we call this the linear 
model/simulation in the following. The sector boundaries resemble a set of annihilating random 
walks, which leads to a coarsening of the sectors of A and B species analogous to the circular 
experiment (see Figure 2, right). But due to the finite width w eventually all the boundaries will 
annihilate and only one sector remains (see Figure 2 left), which is called fixation. The linear 
geometry has also been realized in a real experiment [2], by touching the Petri dish with a sterile 
razor blade that was previously wetted by a liquid mixed culture of flourescently labeled cells. 

3 First results on a single boundary 

To understand the coarsening behaviour of the genetic domains, we will first study the fluctuations 
of a single sector boundary and its connection to the roughness of the growing frontier in the 
linear geometry. We remark that the first results in this section are well known within statistical 
mechanics. We include a short exposition for completeness and refer the reader to fl9ll20ll2"Tl[l4l 
for details. 

3.1 Roughness of the frontier 

If we only consider occupied and unoccupied sites and neglect the species, the growth of the 
population in our lattice based model is well known as the Eden model of type C fl9l |21X It is 
also known, that the surface of Eden clusters belongs to the KPZ universality class ll20l[T5l . which 
implies fluctuations of the surface height to scale with an exponent 1/3 as explained below. At 
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Figure 2: Sample pictures of the simulation modeling the linear colony experiment, where sites at height 
h = are initialised with an A or B cell with equal probability. Fluctuating sector boundaries annihilate 
when they meet, driving the domain coarsening of allele sectors. The height of the growing frontier is seen 
to fluctuate as well. Left: For w — 100 the coarsening of sectors leads to fixation at height h « 350, with 
only one remaining sector (green in the example shown). Right: For w = 1000 the width of the strip is too 
large to reach fixation before the simulation was stopped at height h s» 1200. 



any given time, we denote by 

v ' ieE(t) 

the average surface height, where the sum is carried out over heights hi (t) of the sites in the re- 
producing boundary E{t), containing N(t) cells. The fluctuations are measured by the roughness, 
which is defined as the standard deviation of the surface height, 



S(h) 



V ' i£E(t) 



(1) 



Since h(t) is typically increasing proportional to time t, we can see S(h) as a function of the 
average height, which we will use as a time-like parametrization in the following. Results for the 
KPZ universality class predict the following behaviour for the function S(h), where write simply 
h for its argument from now on. When the surface is grown with periodic boundary conditions on 
a lattice of width w (see e.g. |[T9ll ) we have 

, , J h 13 for h^w z 

1 >~\w a for/i>to 2 ' K) 

where the exponents are given by a = 1/2, /3 = 1/3 and z = aj (3 = 3/2 lil5l . So for small 
heights h the roughness S(h) grows like a power law, and saturates for large h to a stationary value 
depending on the width w of the strip. The crossover occurs at a scale h ~ u> 3 / 2 and the theory 
predicts further, that under rescaled coordinates h/w 3 / 2 the normalized roughness S{h)/w 1 / 2 
follows a universal scaling function for all values of w. This is confirmed in Figure [3] where we 
show data for S(h) averaged over 500 realizations (indicated by (..)) in a double logarithmic plot. 
After rescaling we see a data collapse for different values of w, and the limiting behaviour ([2]) for 
small and large h is indicated by straight lines which fit the data well. This result is of course not 
new, but confirms the arguments in El [TJ1 by independent simulations, that the KPZ universality 
class is indeed the correct framework to study this kind of space limited population growth. 
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Figure 3: Double logarighmic plot of surface roughness S(h) as given in for three different width 
values w = 300, w — 400 and w — 500. (..) denotes an average over 500 measurements, data are binned 
logarithmically to improve the visualization for large values of h. The data collapse confirms the scaling 
prediction Q of the KPZ universality class for the roughness of the population frontier. The straight lines 
indicate the initial growth with power 1/3 and saturation at a constant value. 

3.2 Single sector boundaries 

As explained in the Introduction, the superdiffusive behaviour of domain boundaries can be at- 
tributed to the roughness of the interface G1[T4). The roughness affects the fluctuations of the 
sector boundaries, since locally they grow perpendicular to the population front. This leads to an 
increase in the fluctuations, and to the mean squared displacement of the boundaries growing as 
/i 4 / 3 with height El. It was noted in [2] that this theoretical value coincides well with prelimi- 
nary measurements of phase boundary fluctuations from real experimental data. In the following 
we present simulation data confirming these results, and use them to establish a model for phase 
boundaries which generalizes the purely diffusive approach in lfT2l[T3ll . 

We simulate the linear colonial model with the special initial condition of having only species 
A for x G {1, ... , w/2 — 1} and only species B for x G {w/2, . . . , w}, where w is even. Due to 
periodic boundary conditions, this corresponds to two sector boundaries located initially between 
sites w/2 — 1, w/2 and w, 1. We choose a large width w = 1000 and stop the simulation at 
time t = 1000 well before the fixation time for this system, which is of order w 3 ^ 2 as explained 
below. Therefore the sector boundaries stay well separated throughout the simulation and can be 
considered to be independent. The path of a sector boundary is composed of single step motions 
of the tip, which can move up (U), down (D), left (L) and right (R). We analyze 1000 samples of 
sector boundaries, and our first observation is that the proportion of down steps ttd of the tip is 
very small for all the samples, in summary we find 

(ttjj) = 0.671 , (tt d ) = 0.005 , (tt l ) = 0.163 and (ttr) = 0.161 . (3) 

Therefore we can effectively describe the boundaries by one-dimensional lateral displacements 
Xh as a function of the height h, simply by cancelling rare down steps of the boundary tip with 
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Figure 4: Measurements of the mean squared displacement MSD(h) for the sector boundaries The 
full lines indicate that for small h the behaviour is diffusive MSD(h) s» 0.9 h (red) due to the fiat initial 
conditions, while the best fit to the data (blue) of the form 6ft 4 / 3 is b — 0.344. This describes the data very 
well in accordance with the theoretical prediction 1 14 1. 



subsequent up steps. As expected, the probability of moving left or right is approximately equal, 
which leads to the boundaries following paths of one dimensional symmetric random walkers with 
(Xh) = 0. The fluctuations of such paths are determined by the mean square displacement 

MSD(h) = (X 2 h ) , (4) 

which is shown in Fig.|4]in a double logarithmic plot. Following |[T4l . the KPZ behaviour S(h) ~ 
/i 1 / 3 of the surface roughness implies the scaling prediction MSD(h) ~ /i 4 / 3 for the wandering 
exponent of the sector boundary, which is also in accordance with the real data in [2]. This is 
confirmed from our simulations in Fig. [4] where we show the best fit 

MSD(h) « bh 4/3 with b = 0.344 , (5) 

and a line proportional to h for comparison. The second one assuming diffusive fluctuations 
clearly does not explain the data well except for very small heights, since our simulations start 
with a flat frontier of zero roughness. The fit with fixed exponent 4/3 works well for the bulk of 
the data. A combined fit of the form c/i 4 / 3 + dh did not give any improvement over the 6/i 4//3 fit, 
so we did not include it here. The best fit for the exponent gives a value of 7 = 1.32 which is 
consistent with 4/3 within the accuracy of this measurement. In the following we will therefore 
use MSD(h) = bh 4 / 3 with b as the only fit parameter. 

The superdiffusivity of the boundary is not a result of large jumps as for e.g. Levy processes, 
but arises from the continuous growth of the surface roughness. Therefore, although superdiffu- 
sive, the sector boundaries are still continuous paths on a large scale, which is clearly supported by 
Fig. [2] As a further characterisic, Fig. [5] (right) shows regularized histograms for the distributions 



7 




Figure 5: Sector boundaries behave like time changed Brownian motions, confirming the model (|6jl. Left: 
We plot the renormalized covariances |7} as a function of h on a double logarithmic scale for three different 
values of the lag r. The data (symbols) are averaged over 1000 realizations and agree well with the expected 
linear behaviour h (full line). Right: Regularized histograms approximating the pdf of are generated 
from 1000 realizations, and plotted for three heights feon a logarithmic scale. They agree well with the 
corresponding Gaussian fit (full lines), deviations for large displacements are due to the finite number of 
samples. 



of the displacements for three different values of h which are well approximated by Gaussian 
probability density functions (pdf 's). The simplest continuous model for the sector boundaries 
with the correct mean square displacement and Gaussian pdf 's is a time-changed Brownian mo- 
tion. Thus we take 

Xh, = Bmsd(K) j (6) 

where (Bt : t > 0) is a standard Brownian motion with mean square displacement ) = t and 
Bq = (see [22] Chapter 1.1 for details and formal definitions). Under this assumption we expect 
the covariances 

{X h X h+T ) = (BMSD(h) B MSD(h+T)) = MSD(h) , (7) 

to be independent of the increment r > 0, which is a standard property of Brownian motions (cf. 
ll22l Chapter 1.1). This is confirmed very well by simulation data shown in Fig. [5] (left), where the 
covariances for several values of r follow the data for the mean square displacement taken from 
Fig. [4} Therefore our data are in very good agreement with the model ([6]). 

We would like to remark that there are of course many other continuous Gaussian processes 
such as scaled Brownian motions or fractional Brownian motions as possible models for the do- 
main boundaries. These are characterized by different covariance functions, and none of them 
showed a better agreement than our model (|6]) in Fig. [5] (left). The reason we chose ([6]> is that it 
explains the data very well and is at the same time mathematically easy to treat. 

4 Main results on sector statistics 
4.1 Simulation data on sector statistics 

In this section we will discuss aspects of the sector distributions, focusing on the average (M) and 
the second moment (M 2 ) of the number of sectors M at given times. These statistics are gathered 
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Figure 6: Double logarithmic plot of the average number of sectors (Ml) in the linear model, as a function 
of height h rescaled according to |9]). Data points are averaged over 100 samples. The data for the widths 
w = 280 and w = 564 collapse under height rescaling as expected. The straight lines indicate the expected 
power law /i~ 2 / 3 (Jijl and the limiting behaviour due to fixation. 

for both the circular and linear colony simulations, where in the first case M is a function of height 
h and in the second case of radius r. Measurements for the circular colony simulations are made 
for the initial radius values tq = 50 and ro = 100, and each simulation is performed until the 
radius of the population has increased to roughly 7ro- For the linear colony simulations we take 
two corresponding values of the strip width w = 280 and 564 so that the initial population fronts 
have the same size. These are smaller than 2irrQ due to lattice effects. 

We have seen in the previous section, that the sector boundaries can be modeled by time- 
changed Brownian motions. It is clear from Fig. [2] that when two boundaries meet, the sector 
they enclose will be frozen out and the boundaries annihilate. In the following we will further 
assume that annihilation is the only interaction between the boundaries, i.e. they move freely 
until they meet, which is a good first approximation according to our simulation data. For the 
linear case we are just looking at a set of annihilating Brownian motions in one dimension, which 
is well understood theoretically lfl~6l [T71 . In particular it is known that the average number of 
sectors decays like 1 / y/MSD(h) with height h. This exact result can be understood heuristic ally, 
since the the rate at which boundaries meet and annihilate is proportional to the typical range of 
a boundary covered up to height h, which is given by the root mean square displacement. 
Therefore we expect the number of sectors to decay as 

(M L (h)) ~ \/y/MSD{h) ~ hT 2 ' 3 , (8) 

due to the superdiffusive behaviour, as opposed to h^ 1 / 2 for standard Brownian motions. This 
prediction is confirmed by simulation data in Fig. [6] We show a double logarithmic plot of the 
average number of sectors (Ml), and the data follow a power law with exponent —2/3 very 
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Figure 7: Double logarithmic plot of (Mc) measured for the circular model as a function of radius r with 
initial values ro = 50 and vq = 100. Data points are averaged over 100 samples. Initially (Mc(tq)) — tttq 
and due to the coarsening process Mc{r) decays and converges to a stationary value > 1 which depends 
on r . 



closely in the intermediate scaling regime. For large height h the system leaves the power law 
regime and saturates, reaching fixation where (Ml{K)) — > 1 with only one sector remaining. The 
typical scale r to reach fixation is determined by 

y/2MSD{r) = V2&T 2 / 3 = w/2 => r=(-^=) 3/2 , (9) 



when two boundaries of maximal initial distance w/2 meet. Indeed, rescaling of the /i-axis by the 
characteristic scale r in Fig.[6]leads to a very good data collapse for different strip widths w « 280 
and w « 564, and in both cases (Ml) reaches the limiting value 1 just for h/r = 1. 

The same data (Mc(r)) as a function of radius r are presented in Fig. [7] for the circular 
experiment with two different intial radii ro = 50 and vq = 100. We see that the functional form 
of the decay is not a simple power law in this case, and that for large radius (Mc(r)) stays larger 
than 1. So in contrast to the linear model which fixates with probability one, the circular system 
in general does not fixate. Due to the range expansion the size of the colony boundary increases 
linearly in r and this is faster then the decrease in sector numbers due to coarsening, which is 
of order r~ 2 / 3 . This slows down the decay in the sector numbers until finally it converges to a 
stationary value, which depends on the initial radius ro- Rather than trying to understand these 
features of the circular experiment independently (which is of course possible), we will explain 
in the next section that all its statistical properties are completely determined by the linear model. 
The geometric effects due to the range expansion can be summarized in a proper mapping between 
the radius r and heights h. 
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4.2 Connection between circular and linear model 



To see how the measurements for the two frontier shapes are related, we rescale the statistics of the 
circular frontier simulation to turn them into a function of an effective height h(r), which compares 
directly with the statistics of the linear frontier. To do this we first need to find a relationship 
between a sector boundary X r in the circular colonial simulation with an increasing frontier size, 
and a domain boundary in the linear colonial simulation with a fixed frontier size. Consider 
an interval [0, L(r)] with periodic boundary conditions, which undergoes a space expansion with 
increasing r, in our case L(r) = 2irr and initialy we have L[tq) = 27rro = it;. In the abscence of 
diffusion, the relative position of the boundary in the interval should remain fixed, corresponding 
to a uniform expansion of space. So the rescaled process Y r = L ^°^ X r on the fixed interval 
[0, L(ro)] does not change in time, i.e. 

dY r = ^dX r -X r I ^pldr = 0. (10) 
L(r) L{r) z 

This will fix the determistic speed of X r due to space expansion and leads to the following SDE 
for X r : 

L'(r) 

dX r = X r —^-dr + a(r-r )dB r for r > r , (11) 
L(r) 

where the fluctuations are determined by the model B MSD ( r - rQ ) for a single boundary, as estab- 
lished in (|6]>. Here we use the incremental form 

dBMSD(r-r ) = a ( r ~ r o) dB r with diffusivity <j(r) = \/MSD'(r) , (12) 
with a standard Brownian motion B r for r > ro and B r = 0. In the circular case the role of 



height is taken over by the radius r — ro relative to the initial condition. Substituting (11) into ( 10 1 
leads to the following simple SDE for the rescaled process Y r , 

L{r )a{r -r ) r 

dY r = — dB r = — air — ro) dB r . (13) 

L[r) r 

Therefore, Y r is a continuous martingale (see e.g. [22], Chapter II.5) and can be written as a 
stochastic integral 

Y r = [ ^a(s-r )dB s . (14) 

Jr s 

The mean square displacement of the process Y r is given by the quadratic variation 

pr 2 

MSD{r) = (Y r 2 ) = / T -\ a(s - r ) 2 ds , (15) 
Jr s 

which is a monotone function of r. Also the linear boundary dXh = cr(h)dBh for h > is 
modelled by a continuous martingale as derived in Section 3.2, so both are time changes of stan- 
dard Brownian motions (see e.g. [22J, Chapter II.5). Therefore, under a proper mapping h(r) 
the circular and linear boundaries are statistically equivalent processes, i.e. (Y r : r > ro) and 
(X/j( r ) : r > ro) have the same distribution. The correct time change can be found by equating 
the mean square displacement ( [T5] > 

MSD(r) = bK 1 (16) 
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Figure 8: The mapping between the circular and the linear model as given by h(r) in ( 17 1. Top: Double 
logarithmic plot of h(r — ro) for ro = 50, 100 and 7 = 4/3 (full lines), and without superdiffusion for 
ro = 50 and 7 = 1 (broken line) for comparison. Bottom: Convergence of h(r) — > hoo as r — > 00 explains 
that the circular model (left) exhibits a non-trivial stationary statistic given by the line on the right, 
where typically more than one sector survives. 
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with that of a linear boundary as derived in Section 3.2. This leads to the following relationship 
h(r) between height and radius, 



h(r) 



-| a(s - r ) 2 ds 



»■() 



" 1/7 


r 7 




1 




L -Jr 



) 7 ~ x ds 



l/ 7 



(17) 



where we have used a(s — ro) 2 = MSD'(s — ro) = &7(s — ro) 7-1 . This function depends on 
the initial radius ro and on the exponent of the mean square fluctuations 7 which is 4/3 in our 
case, but not on the prefactor b. For small r — ro we can neglect the term (ro/s) 2 in the integral 
and see that h(r — ro) ~ r — ro is approximately linear. This is to be expected, since close to 
the initial conditions the linear and circular geometry are locally equivalent and Y r - ro « X r . On 
the other hand, it is clear that the integral in ( 17 1 converges as r — > 00 to a finite number 
depending on 7 and ro. In fact, h{r) can be written for 7 > 1 in terms of the incomplete beta 
function B z (a, b) = s^ 1 (1 - sf' 1 ds as 



h(r) = r 



^7(1 ~ 7) 
sin(7T7) 



7-^ro /r 



7,7 



i/ 7 



(18) 



The first term determines the limit as r — > 00, and for 7 = 4/3 its numerical value is given by 
/ioo = 1.43 ro- On the other hand, for the diffusive case with 7 = 1 the integral (17) can be 
evaluated easily and we get the even simpler expression 



h(r) = r (l- 



ro as r — > 00 



(19) 



These results are illustrated in Fig.[8](top), where we plot h(r — ro) on a double logarithmic scale 
for the values of interest in our simulations, ro = 50, 100 and 7 = 4/3. For comparison we also 
show 7 = 1 and ro = 50 and see that neglecting the superdiffusivity of the sector boundaries 
would lead to a significant change in the mapping. 

To compare the coarsening process of sector boundaries, note that the circular boundaries X r 
meet and annihilate if and only if the rescaled boundaries Y r do so. Since the latter are statistically 
equivalent to the linear boundaries X^, the same is true for the statistics of the coarsening process. 
In fact, using the mapping h(r) the whole circular experiment can be mapped onto and fully 
understood by the linear one. This is illustrated in Fig. [8] (bottom), and in Fig.[9](left) we show the 
rescaled circular data (h(r), (Mc(r)}) and the linear data (h, (Mx,(/i))) for the average number 
of sectors in the same plot. We see a very good agreement for these statistics, and knowledge 
of the limit = 1.43 ro for r — > 00 can be used to determine the limiting average number 
of sectors (Mc(oo)) in the circular experiment. This turns out to be around 7 for ro = 50 and 
around 8.8 for ro = 100. Also higher order moments of the number of sectors lead to a very good 
agreement (see Fig. [9] on the right for the second moment), confirming our result of a complete 
statistical equivalence of the two experiments. 

The linear model reaches fixation at heights of order r ~ w 3 ^ 2 as derived in ^ from the 
finite width w of the strip and the sector boundary fluctuations of order /1 2 / 3 . The circular model 
undergoes a range expansion where the size of the colony frontier grows linearly with radius r 
due to the circular geometry. This growth is faster than the fluctuations of the sector boundaries of 
order (r — ro) 2 / 3 , so that fixation is typically not reached. For the rescaled sector boundaries Y r 
the range expansion leads to an effective decrease in the diffusivity a(r — ro)/r as seen in ( 13 ). 
This effect is quantitatively characterized by convergence of the function 



K r ) ~+ fr-oo ~ r =w/ (2tt) 



as r 



00 



(20) 



13 




Figure 9: The mapping between the circular and the linear model applied to our simulation data for 
ro = 50 and ro = 100 in a double logarithmic plot. Left: The linear data (M^h)) are shown by full lines, 
and the corresponding circular data by symbols, where we plot (Mc(r)) against h(r) according to (17i. 
The limiting values /i^ as r — > oo are indicated by dashed lines and allow to determine the number of 
sectors (Mc (oo)) for large distances. The data are averaged over 100 realizations. Right: Analogous plot 
for the second moment (M 2 ). 



where the limit is of order ro with only the prefactor depending on 7, as we have seen above. 
The stationary statistics of the circular sectors at large distance is given by statistics of the linear 
sectors at height /i m ~ w, so the probability of reaching fixation in the circular model essentially 
vanishes for large w. This can be seen in Fig.[9j where the average number of sectors in the circular 
experiments are significantly larger than 1 (which would correspond to fixation). 



5 Discussion 

We have presented numerical evidence, that the sector boundaries of segregation patterns on ex- 
panding population fronts can be modelled by time changes of Brownian motion. This lead to a 
complete understanding of the sector statistics for expanding circular geometries in terms of lin- 
ear growth models with fixed width, which have been studied in great detail in the mathematics 



literature. The mapping ( 17 1 implies that h{r) converges to a finite limit as r — > 00 as long as 

MSD'(r)/r 2 ~ r 7 " 3 « 1/r forr^oo, (21) 

which holds for all 7 < 2. So as long as the motion of sector boundaries is not ballistic (i.e. 7 = 
2), expansion will dominate the sector coarsening process and there is no fixation for expanding 
population fronts in a circular geometry. Ballistic sector boundaries can result from different 
fitnesses modelled e.g. by different reproduction rates r& and rg for the species. In that case the 
effects of circular range expansion and coarsening are on the same scale, which leads to interesting 
competition effects lfT2l[T3ll . On the other hand, the strength of the range expansion effects can also 
be affected by non-circular geometries which lead to a non-linear growth of the population front 
with distance. This could be the result of certain geometrical constraints in the growth medium 
such as a landscape, which can therefore affect the biodiversity at the population front. We still 
expect that such systems can be understood by a mapping to the linear experiment analogous to 
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the one presented in this paper. In general, the linear model provides the prototype for the pure 
coarsening effects due to genetic drift, and the full behaviour is determined by an interplay with 
geometric effects, which can be summarized in an effective mapping. 

The mathematical basis of this argument is that all continuous martingales are time-changed 
Brownian motions. Other possible models for the sector boundaries such as scaled Brownian 
motions or fractional Brownian motions are not martingales and the argument as presented here 
does not strictly apply. Nevertheless, a mapping between different geometries similar to the one 
presented here should be justifyable at least heuristically also for other boundary models, since the 
sector boundaries do not exhibit large jumps or interact on macroscopic scales. These conditions 
are fulfilled in general as long as the species reproduces locally and does not migrate on a time 
scale faster than that of reproduction. Note also that the sector boundaries inside the population do 
not have to be static in more general situations. But as long as the range expansion is fast enough, 
changes due to death, migration or reproduction inside the population will not affect the behaviour 
at the population front. So our approach is widely applicable for locally reproducing species under 
range expansions, as long as boundary growth is the fastest relevant process in the system. 

According to the well established paradigm of the KPZ universality class lfl31 . the scaling 
behaviour of the roughness S(h) ~ /i 1 / 3 of the population front ^ is largely independent on the 
microscopic details of the growth rules. This holds as long as spatial and temporal correlation 
lengths, resulting from these details or external effects from the environment, are small compared 
to the system size and growth time, respectively. If this is the case, the exponent 7 of the path of 
sector boundaries is therefore not affected by details of the mode of reproduction of a single indi- 
viduum, which will only influence the prefactor b (cf. Equ. ([5])). This determines the width of the 
sectors in the linear geometry in a very simple fashion (cf. Equ. (|9])), whereas our main result, the 
mapping h(r), is independent of b and therefore applicable to a large class of locally reproducing 
species. The lattice based simulation model presented in this paper is only a simple caricature of 
that class, where we have assumed that cells divide in a spatially homogeneous fashion and follow 
a continuous time Markov process, i.e. their reproduction time has an exponential distribution. 
Although these assumptions do not hold for E. coli, the correlation lengths referred to above are 
quite small and our simulations are therefore a good coarse grained model on a macroscopic level 
(cf. Fig. [TJ. It is quantitatively close, but to get the best possible agreement one would have to 
fit b from a larger set of experimental data. It is of course an intriguing problem to predict the 
parameter b from the microscopic details of the mode of replication, and we are currently working 
on a flexible off-lattice simulation model that will allow us to investigate this in detail. A first 
interesting example for comparison was also reported in [2], namely that for yeast (S. cerevisiae) 
the sectoring patterns are significantly finer than the ones observed for E. coli, corresponding to a 
reduced mobility of the boundaries and a smaller value of b. This could be the result of the larger 
size of yeast, but temporal aspects of the reproduction are also expected to play a major role. 
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